Overview

Dataset info

Number of variables27
Number of observations500137
Missing cells274972 (2.0%)
Duplicate rows0 (0.0%)
Total size in memory396.0 MiB
Average record size in memory830.3 B

Variables types

NUM12
CAT11
BOOL4

Reproduction info

Date of analysis2020-05-04 01:37:24.802971
Versionpandas-profiling v2.4.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download Configurationconfig.yaml

Warnings

FIRST_TIME_HOMEBUYER_FLAG has 130559 (26.1%) missing values Missing
LOAN_SEQUENCE_NUMBER has a high cardinality: 500137 distinct values Warning
METROPOLITAN_STATISTICAL_AREA has 70149 (14.0%) missing values Missing
MORTGAGE_INSURANCE_PERCENTAGE has 51048 (10.2%) missing values Missing
MORTGAGE_INSURANCE_PERCENTAGE has 309979 (62.0%) zeros Zeros
ORIGINAL_DEBT_TO_INCOME_RATIO has 14929 (3.0%) missing values Missing
PREPAYMENT_PENALTY_MORTGAGE_FLAG has 5178 (1.0%) missing values Missing
PRODUCT_TYPE has constant value "FRM" Rejected
PROPERTY_STATE has a high cardinality: 53 distinct values Warning
MATURITY_DATE is highly correlated with FIRST_PAYMENT_DATEHigh Correlation
FIRST_PAYMENT_DATE is highly correlated with MATURITY_DATEHigh Correlation
ORIGINAL_LOAN_TO_VALUE is highly correlated with ORIGINAL_COMBINED_LOAN_TO_VALUEHigh Correlation
ORIGINAL_COMBINED_LOAN_TO_VALUE is highly correlated with ORIGINAL_LOAN_TO_VALUEHigh Correlation

Variables

CHANNEL
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.8 MiB
T
280054
R
219648
C
 
272
B
 
163
ValueCountFrequency (%) 
T 280054 56.0%
 
R 219648 43.9%
 
C 272 0.1%
 
B 163 < 0.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length1
Mean length1
Min length1
Scatter

CREDIT_SCORE
Real number (ℝ≥0)

Distinct count391
Unique (%)0.1%
Missing2711
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean712.5362124
Minimum300
Maximum839
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum300
5-th percentile620
Q1676
median719
Q3756
95-th percentile788
Maximum839
Range539
Interquartile range (IQR)80

Descriptive statistics

Standard deviation54.79126197
Coefficient of variation (CV)0.0768961086
Kurtosis2.801586688
Mean712.5362124
Median Absolute Deviation (MAD)44.36682138
Skewness-0.891283879
Sum354434038
Variance3002.082389
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
748 3881 0.8%
 
756 3870 0.8%
 
754 3826 0.8%
 
766 3772 0.8%
 
764 3757 0.8%
 
747 3757 0.8%
 
734 3749 0.7%
 
760 3674 0.7%
 
753 3664 0.7%
 
745 3629 0.7%
 
Other values (380) 459847 91.9%
 
ValueCountFrequency (%) 
300 511 0.1%
 
333 1 < 0.1%
 
359 1 < 0.1%
 
363 1 < 0.1%
 
366 1 < 0.1%
 
ValueCountFrequency (%) 
839 1 < 0.1%
 
838 2 < 0.1%
 
837 2 < 0.1%
 
835 1 < 0.1%
 
832 1 < 0.1%
 

DELINQUENT
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size488.5 KiB
False
482146
True
 
17991
ValueCountFrequency (%) 
False 482146 96.4%
 
True 17991 3.6%
 

FIRST_PAYMENT_DATE
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count73
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200025.431
Minimum199901
Maximum201103
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum199901
5-th percentile199903
Q1199904
median200005
Q3200105
95-th percentile200203
Maximum201103
Range1202
Interquartile range (IQR)201

Descriptive statistics

Standard deviation109.8155414
Coefficient of variation (CV)0.0005490078981
Kurtosis-1.400368035
Mean200025.431
Median Absolute Deviation (MAD)100.6621303
Skewness0.1487456108
Sum1.00040119e+11
Variance12059.45314
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[199901. 199901.5 199902.5 199904.5 199905.5 ... 200309.5 200311. 200401.5 200408. 201103. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
200105 72893 14.6%
 
199905 67536 13.5%
 
199903 62459 12.5%
 
199904 62279 12.5%
 
200104 57531 11.5%
 
200203 44289 8.9%
 
200103 40800 8.2%
 
200005 23578 4.7%
 
200004 20218 4.0%
 
200003 19070 3.8%
 
Other values (63) 29484 5.9%
 
ValueCountFrequency (%) 
199901 8 < 0.1%
 
199902 1473 0.3%
 
199903 62459 12.5%
 
199904 62279 12.5%
 
199905 67536 13.5%
 
ValueCountFrequency (%) 
201103 1 < 0.1%
 
200701 1 < 0.1%
 
200604 1 < 0.1%
 
200505 2 < 0.1%
 
200503 2 < 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing130559
Missing (%)26.1%
Memory size3.8 MiB
N
320418
Y
 
49160
(Missing)
130559
ValueCountFrequency (%) 
N 320418 64.1%
 
Y 49160 9.8%
 
(Missing) 130559 26.1%
 

LOAN_PURPOSE
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.8 MiB
P
214791
N
174293
C
111053
ValueCountFrequency (%) 
P 214791 42.9%
 
N 174293 34.8%
 
C 111053 22.2%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length1
Mean length1
Min length1
Scatter

LOAN_SEQUENCE_NUMBER
Categorical

UNIQUE
HIGH CARDINALITY
Distinct count500137
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.8 MiB
F101Q1123130
 
1
F199Q1169655
 
1
F199Q1044484
 
1
F199Q1317097
 
1
F100Q1041580
 
1
Other values (500132)
500132
ValueCountFrequency (%) 
F101Q1123130 1 < 0.1%
 
F199Q1169655 1 < 0.1%
 
F199Q1044484 1 < 0.1%
 
F199Q1317097 1 < 0.1%
 
F100Q1041580 1 < 0.1%
 
F199Q1161220 1 < 0.1%
 
F199Q1335685 1 < 0.1%
 
F102Q1091349 1 < 0.1%
 
F101Q1206245 1 < 0.1%
 
F101Q1170643 1 < 0.1%
 
Other values (500127) 500127 > 99.9%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length12
Mean length12
Min length12
Scatter

MATURITY_DATE
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count122
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean203023.1959
Minimum202402
Maximum204101
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum202402
5-th percentile202902
Q1202903
median203004
Q3203104
95-th percentile203202
Maximum204101
Range1699
Interquartile range (IQR)201

Descriptive statistics

Standard deviation110.3841886
Coefficient of variation (CV)0.0005437023493
Kurtosis-1.163418093
Mean203023.1959
Median Absolute Deviation (MAD)100.6100265
Skewness0.09022322708
Sum1.015394121e+11
Variance12184.66908
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[202402. 202408.5 202411.5 202501.5 202504.5 ... 203211.5 203301.5 203304.5 203558. 204101. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
203104 72734 14.5%
 
202904 67591 13.5%
 
202902 62308 12.5%
 
202903 62161 12.4%
 
203103 57136 11.4%
 
203202 44254 8.8%
 
203102 40558 8.1%
 
203004 24252 4.8%
 
203003 20652 4.1%
 
203002 19382 3.9%
 
Other values (112) 29109 5.8%
 
ValueCountFrequency (%) 
202402 1 < 0.1%
 
202403 2 < 0.1%
 
202404 2 < 0.1%
 
202405 4 < 0.1%
 
202406 2 < 0.1%
 
ValueCountFrequency (%) 
204101 1 < 0.1%
 
203612 1 < 0.1%
 
203504 1 < 0.1%
 
203502 2 < 0.1%
 
203406 1 < 0.1%
 

METROPOLITAN_STATISTICAL_AREA
Real number (ℝ≥0)

MISSING
Distinct count391
Unique (%)0.1%
Missing70149
Missing (%)14.0%
Infinite0
Infinite (%)0.0%
Mean30777.82474
Minimum10180
Maximum49740
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum10180
5-th percentile12420
Q119740
median33340
Q340420
95-th percentile47644
Maximum49740
Range39560
Interquartile range (IQR)20680

Descriptive statistics

Standard deviation11333.40114
Coefficient of variation (CV)0.3682326883
Kurtosis-1.267260635
Mean30777.82474
Median Absolute Deviation (MAD)9961.585815
Skewness-0.2019569847
Sum1.32340953e+10
Variance128445981.5
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
16974 17051 3.4%
 
31084 12933 2.6%
 
12060 11603 2.3%
 
38060 11039 2.2%
 
33460 10773 2.2%
 
19740 9820 2.0%
 
47644 9763 2.0%
 
47894 8595 1.7%
 
42044 7788 1.6%
 
41740 7376 1.5%
 
Other values (380) 323247 64.6%
 
(Missing) 70149 14.0%
 
ValueCountFrequency (%) 
10180 49 < 0.1%
 
10380 1 < 0.1%
 
10420 1193 0.2%
 
10500 93 < 0.1%
 
10580 477 0.1%
 
ValueCountFrequency (%) 
49740 184 < 0.1%
 
49700 196 < 0.1%
 
49660 515 0.1%
 
49620 704 0.1%
 
49420 327 0.1%
 

MORTGAGE_INSURANCE_PERCENTAGE
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count41
Unique (%)< 0.1%
Missing51048
Missing (%)10.2%
Infinite0
Infinite (%)0.0%
Mean7.744531708
Minimum0
Maximum55
Zeros309979
Zeros (%)62.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q318
95-th percentile30
Maximum55
Range55
Interquartile range (IQR)18

Descriptive statistics

Standard deviation12.04654597
Coefficient of variation (CV)1.555490561
Kurtosis-0.7749831851
Mean7.744531708
Median Absolute Deviation (MAD)10.69273212
Skewness1.025832107
Sum3477984
Variance145.1192698
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 309979 62.0%
 
30 53985 10.8%
 
25 53585 10.7%
 
12 16365 3.3%
 
17 5397 1.1%
 
18 4279 0.9%
 
35 1518 0.3%
 
20 722 0.1%
 
36 556 0.1%
 
29 509 0.1%
 
Other values (30) 2194 0.4%
 
(Missing) 51048 10.2%
 
ValueCountFrequency (%) 
0 309979 62.0%
 
1 6 < 0.1%
 
5 1 < 0.1%
 
6 177 < 0.1%
 
8 1 < 0.1%
 
ValueCountFrequency (%) 
55 2 < 0.1%
 
53 1 < 0.1%
 
52 2 < 0.1%
 
50 2 < 0.1%
 
47 2 < 0.1%
 
Distinct count3
Unique (%)< 0.1%
Missing247
Missing (%)< 0.1%
Memory size3.8 MiB
2
315078
1
184812
ValueCountFrequency (%) 
2 315078 63.0%
 
1 184812 37.0%
 
(Missing) 247 < 0.1%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

NUMBER_OF_UNITS
Categorical

Distinct count5
Unique (%)< 0.1%
Missing3
Missing (%)< 0.1%
Memory size3.8 MiB
1
489352
2
 
8359
4
 
1244
3
 
1179
ValueCountFrequency (%) 
1 489352 97.8%
 
2 8359 1.7%
 
4 1244 0.2%
 
3 1179 0.2%
 
(Missing) 3 < 0.1%
 

Composition

Contains charsTrue
Contains digitsTrue
Contains whitespaceFalse
Contains non-wordsTrue

Length

Max length3
Mean length3
Min length3
Scatter

OCCUPANCY_STATUS
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.8 MiB
O
465817
I
 
20109
S
 
14211
ValueCountFrequency (%) 
O 465817 93.1%
 
I 20109 4.0%
 
S 14211 2.8%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length1
Mean length1
Min length1
Scatter

ORIGINAL_COMBINED_LOAN_TO_VALUE
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count116
Unique (%)< 0.1%
Missing13
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean76.05357071
Minimum6
Maximum180
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum6
5-th percentile45
Q170
median80
Q388
95-th percentile95
Maximum180
Range174
Interquartile range (IQR)18

Descriptive statistics

Standard deviation15.13998605
Coefficient of variation (CV)0.1990700227
Kurtosis1.458289543
Mean76.05357071
Median Absolute Deviation (MAD)11.27574618
Skewness-1.117783997
Sum38036216
Variance229.2191775
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
80 112011 22.4%
 
95 55449 11.1%
 
90 43065 8.6%
 
75 26192 5.2%
 
79 14931 3.0%
 
78 11732 2.3%
 
77 10213 2.0%
 
74 10111 2.0%
 
70 10104 2.0%
 
85 9271 1.9%
 
Other values (105) 197045 39.4%
 
ValueCountFrequency (%) 
6 12 < 0.1%
 
7 19 < 0.1%
 
8 35 < 0.1%
 
9 29 < 0.1%
 
10 43 < 0.1%
 
ValueCountFrequency (%) 
180 1 < 0.1%
 
175 1 < 0.1%
 
160 12 < 0.1%
 
159 1 < 0.1%
 
156 1 < 0.1%
 

ORIGINAL_DEBT_TO_INCOME_RATIO
Real number (ℝ≥0)

MISSING
Distinct count66
Unique (%)< 0.1%
Missing14929
Missing (%)3.0%
Infinite0
Infinite (%)0.0%
Mean32.91754052
Minimum1
Maximum65
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum1
5-th percentile15
Q125
median33
Q341
95-th percentile51
Maximum65
Range64
Interquartile range (IQR)16

Descriptive statistics

Standard deviation11.11179999
Coefficient of variation (CV)0.3375647093
Kurtosis-0.2547628592
Mean32.91754052
Median Absolute Deviation (MAD)8.974398479
Skewness0.06650457841
Sum15971854
Variance123.4720991
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
28 20186 4.0%
 
36 16692 3.3%
 
33 16398 3.3%
 
35 16323 3.3%
 
34 16292 3.3%
 
32 16021 3.2%
 
37 15988 3.2%
 
31 15783 3.2%
 
38 15714 3.1%
 
30 15576 3.1%
 
Other values (55) 320235 64.0%
 
ValueCountFrequency (%) 
1 117 < 0.1%
 
2 216 < 0.1%
 
3 363 0.1%
 
4 447 0.1%
 
5 574 0.1%
 
ValueCountFrequency (%) 
65 524 0.1%
 
64 572 0.1%
 
63 675 0.1%
 
62 662 0.1%
 
61 768 0.2%
 

ORIGINAL_INTEREST_RATE
Real number (ℝ≥0)

Distinct count472
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.182686864
Minimum4.625
Maximum11.5
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum4.625
5-th percentile6.5
Q16.875
median7
Q37.375
95-th percentile8.49
Maximum11.5
Range6.875
Interquartile range (IQR)0.5

Descriptive statistics

Standard deviation0.5799408624
Coefficient of variation (CV)0.08074149318
Kurtosis1.498757734
Mean7.182686864
Median Absolute Deviation (MAD)0.4382486878
Skewness1.224618691
Sum3592327.46
Variance0.3363314039
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 4.625 4.9375 5.4 5.4975 5.51 ... 9.995 10.0625 10.5625 10.8 11.5 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6.875 88342 17.7%
 
7 62799 12.6%
 
6.75 56918 11.4%
 
7.125 43344 8.7%
 
7.25 42294 8.5%
 
6.625 27526 5.5%
 
7.375 27106 5.4%
 
6.5 21847 4.4%
 
7.5 17728 3.5%
 
8.25 12304 2.5%
 
Other values (462) 99929 20.0%
 
ValueCountFrequency (%) 
4.625 1 < 0.1%
 
4.73 1 < 0.1%
 
4.75 2 < 0.1%
 
4.875 6 < 0.1%
 
5 20 < 0.1%
 
ValueCountFrequency (%) 
11.5 1 < 0.1%
 
10.875 2 < 0.1%
 
10.85 1 < 0.1%
 
10.75 5 < 0.1%
 
10.625 8 < 0.1%
 

ORIGINAL_LOAN_TERM
Real number (ℝ≥0)

Distinct count62
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean359.8554696
Minimum301
Maximum362
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum301
5-th percentile360
Q1360
median360
Q3360
95-th percentile360
Maximum362
Range61
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.90825071
Coefficient of variation (CV)0.005302825361
Kurtosis366.4589173
Mean359.8554696
Median Absolute Deviation (MAD)0.2863856088
Skewness-17.67165681
Sum179977035
Variance3.641420774
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[301. 304.5 311.5 312.5 323.5 ... 356.5 358.5 359.5 360.5 362. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
360 495446 99.1%
 
354 607 0.1%
 
348 405 0.1%
 
349 282 0.1%
 
336 209 < 0.1%
 
350 208 < 0.1%
 
353 204 < 0.1%
 
359 194 < 0.1%
 
351 191 < 0.1%
 
352 163 < 0.1%
 
Other values (52) 2228 0.4%
 
ValueCountFrequency (%) 
301 6 < 0.1%
 
302 6 < 0.1%
 
303 4 < 0.1%
 
304 5 < 0.1%
 
305 10 < 0.1%
 
ValueCountFrequency (%) 
362 1 < 0.1%
 
361 6 < 0.1%
 
360 495446 99.1%
 
359 194 < 0.1%
 
358 99 < 0.1%
 

ORIGINAL_LOAN_TO_VALUE
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count96
Unique (%)< 0.1%
Missing9
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean75.71071406
Minimum6
Maximum100
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum6
5-th percentile45
Q170
median80
Q385
95-th percentile95
Maximum100
Range94
Interquartile range (IQR)15

Descriptive statistics

Standard deviation14.93771709
Coefficient of variation (CV)0.1972999103
Kurtosis1.531392305
Mean75.71071406
Median Absolute Deviation (MAD)11.05492102
Skewness-1.140481569
Sum37865048
Variance223.1353918
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
80 122496 24.5%
 
95 50433 10.1%
 
90 37716 7.5%
 
75 26517 5.3%
 
79 15344 3.1%
 
78 12042 2.4%
 
77 10462 2.1%
 
74 10223 2.0%
 
70 10156 2.0%
 
85 9052 1.8%
 
Other values (85) 195687 39.1%
 
ValueCountFrequency (%) 
6 12 < 0.1%
 
7 19 < 0.1%
 
8 35 < 0.1%
 
9 29 < 0.1%
 
10 43 < 0.1%
 
ValueCountFrequency (%) 
100 596 0.1%
 
99 11 < 0.1%
 
98 10 < 0.1%
 
97 5532 1.1%
 
96 125 < 0.1%
 

ORIGINAL_UPB
Real number (ℝ≥0)

Distinct count433
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean136493.4848
Minimum8000
Maximum578000
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum8000
5-th percentile52000
Q189000
median126000
Q3176000
95-th percentile250000
Maximum578000
Range570000
Interquartile range (IQR)87000

Descriptive statistics

Standard deviation60968.74307
Coefficient of variation (CV)0.4466787786
Kurtosis-0.2172889233
Mean136493.4848
Median Absolute Deviation (MAD)49931.7113
Skewness0.5810446598
Sum6.8265442e+10
Variance3717187631
Histogram
Histogram with fixed size bins (bins=10)
Histogram
Histogram with variable size bins (bins=[ 8000. 12500. 16500. 19500. 20500. ... 424500. 426000. 458500. 462500. 578000.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
100000 9471 1.9%
 
275000 7859 1.6%
 
240000 7655 1.5%
 
120000 6162 1.2%
 
150000 5964 1.2%
 
80000 5628 1.1%
 
200000 5542 1.1%
 
90000 5483 1.1%
 
140000 5145 1.0%
 
110000 4987 1.0%
 
Other values (423) 436241 87.2%
 
ValueCountFrequency (%) 
8000 1 < 0.1%
 
9000 1 < 0.1%
 
10000 7 < 0.1%
 
11000 3 < 0.1%
 
12000 3 < 0.1%
 
ValueCountFrequency (%) 
578000 1 < 0.1%
 
560000 2 < 0.1%
 
544000 1 < 0.1%
 
529000 4 < 0.1%
 
525000 1 < 0.1%
 

POSTAL_CODE
Real number (ℝ≥0)

Distinct count893
Unique (%)0.2%
Missing31
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean55490.85714
Minimum600
Maximum99900
Zeros0
Zeros (%)0.0%
Memory size3.8 MiB
Mini histogram

Quantile statistics

Minimum600
5-th percentile7000
Q130500
median54200
Q385000
95-th percentile97100
Maximum99900
Range99300
Interquartile range (IQR)54500

Descriptive statistics

Standard deviation29505.38226
Coefficient of variation (CV)0.5317161021
Kurtosis-1.240880713
Mean55490.85714
Median Absolute Deviation (MAD)25685.62965
Skewness-0.0841622712
Sum2.77513106e+10
Variance870567582.2
Histogram
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
94500 7240 1.4%
 
85200 5775 1.2%
 
30000 5733 1.1%
 
48100 5286 1.1%
 
60000 5161 1.0%
 
48000 4347 0.9%
 
92600 4305 0.9%
 
60100 4295 0.9%
 
60600 4071 0.8%
 
75000 4035 0.8%
 
Other values (882) 449858 89.9%
 
ValueCountFrequency (%) 
600 159 < 0.1%
 
700 219 < 0.1%
 
900 466 0.1%
 
1000 346 0.1%
 
1100 69 < 0.1%
 
ValueCountFrequency (%) 
99900 17 < 0.1%
 
99800 68 < 0.1%
 
99700 42 < 0.1%
 
99600 139 < 0.1%
 
99500 461 0.1%
 

PREPAID
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size488.5 KiB
True
480724
False
 
19413
ValueCountFrequency (%) 
True 480724 96.1%
 
False 19413 3.9%
 
Distinct count3
Unique (%)< 0.1%
Missing5178
Missing (%)1.0%
Memory size3.8 MiB
N
492669
Y
 
2290
(Missing)
 
5178
ValueCountFrequency (%) 
N 492669 98.5%
 
Y 2290 0.5%
 
(Missing) 5178 1.0%
 

PRODUCT_TYPE
Categorical

CONST
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.8 MiB
FRM
500137
ValueCountFrequency (%) 
FRM 500137 100.0%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length3
Mean length3
Min length3
Scatter

PROPERTY_STATE
Categorical

HIGH CARDINALITY
Distinct count53
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.8 MiB
CA
72566
FL
 
30088
MI
 
26956
IL
 
26175
TX
 
22786
Other values (48)
321566
ValueCountFrequency (%) 
CA 72566 14.5%
 
FL 30088 6.0%
 
MI 26956 5.4%
 
IL 26175 5.2%
 
TX 22786 4.6%
 
OH 20334 4.1%
 
CO 17943 3.6%
 
GA 16490 3.3%
 
AZ 16072 3.2%
 
NC 16052 3.2%
 
Other values (43) 234675 46.9%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length2
Mean length2
Min length2
Scatter

PROPERTY_TYPE
Categorical

Distinct count7
Unique (%)< 0.1%
Missing95
Missing (%)< 0.1%
Memory size3.8 MiB
SF
410630
PU
 
53455
CO
 
33639
MH
 
1741
CP
 
380
ValueCountFrequency (%) 
SF 410630 82.1%
 
PU 53455 10.7%
 
CO 33639 6.7%
 
MH 1741 0.3%
 
CP 380 0.1%
 
LH 197 < 0.1%
 
(Missing) 95 < 0.1%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceFalse
Contains non-wordsFalse

Length

Max length3
Mean length2.000189948
Min length2
Scatter

SELLER_NAME
Categorical

Distinct count48
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.8 MiB
Other sellers
109360
WELLSFARGOHOMEMORTGA
63768
ABNAMROMTGEGROUP,INC
50543
NORWEST MORTGAGE, IN
 
23080
BANKOFAMERICA,NA
 
21064
Other values (43)
232322
ValueCountFrequency (%) 
Other sellers 109360 21.9%
 
WELLSFARGOHOMEMORTGA 63768 12.8%
 
ABNAMROMTGEGROUP,INC 50543 10.1%
 
NORWEST MORTGAGE, IN 23080 4.6%
 
BANKOFAMERICA,NA 21064 4.2%
 
NATLCITYMTGECO 18303 3.7%
 
COUNTRYWIDE HOME LOA 17416 3.5%
 
NORWESTMORTGAGE,INC 17248 3.4%
 
PRINCIPALRESIDENTIAL 13603 2.7%
 
STANDARD FEDERAL BAN 11591 2.3%
 
Other values (38) 154161 30.8%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length20
Mean length17.60952099
Min length8
Scatter

SERVICER_NAME
Categorical

Distinct count26
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size3.8 MiB
Other servicers
94141
WELLSFARGOHOMEMORTGA
86449
BANKOFAMERICA,NA
42354
WASHINGTONMUTUALBANK
 
38851
ABNAMROMTGEGROUP,INC
 
38145
Other values (21)
200197
ValueCountFrequency (%) 
Other servicers 94141 18.8%
 
WELLSFARGOHOMEMORTGA 86449 17.3%
 
BANKOFAMERICA,NA 42354 8.5%
 
WASHINGTONMUTUALBANK 38851 7.8%
 
ABNAMROMTGEGROUP,INC 38145 7.6%
 
CHASEMTGECO 26843 5.4%
 
NATLCITYMTGECO 22907 4.6%
 
WELLSFARGOBANK,NA 22888 4.6%
 
COUNTRYWIDE 18494 3.7%
 
PRINCIPALRESIDENTIAL 14962 3.0%
 
Other values (16) 94103 18.8%
 

Composition

Contains charsTrue
Contains digitsFalse
Contains whitespaceTrue
Contains non-wordsTrue

Length

Max length20
Mean length16.92317705
Min length8
Scatter

Correlations

Missing values

Sample

First rows

CHANNELCREDIT_SCOREDELINQUENTFIRST_PAYMENT_DATEFIRST_TIME_HOMEBUYER_FLAGLOAN_PURPOSELOAN_SEQUENCE_NUMBERMATURITY_DATEMETROPOLITAN_STATISTICAL_AREAMORTGAGE_INSURANCE_PERCENTAGENUMBER_OF_BORROWERSNUMBER_OF_UNITSOCCUPANCY_STATUSORIGINAL_COMBINED_LOAN_TO_VALUEORIGINAL_DEBT_TO_INCOME_RATIOORIGINAL_INTEREST_RATEORIGINAL_LOAN_TERMORIGINAL_LOAN_TO_VALUEORIGINAL_UPBPOSTAL_CODEPREPAIDPREPAYMENT_PENALTY_MORTGAGE_FLAGPRODUCT_TYPEPROPERTY_STATEPROPERTY_TYPESELLER_NAMESERVICER_NAME
0R669.0False200206NPF199Q1000004202901NaN0.02.01.0O80.033.07.12032080.016200026100.0TrueNFRMWVSFOther sellersOther servicers
1R732.0False199904NNF199Q100000520290317140.00.01.01.0O25.010.06.50036025.05300045200.0TrueNFRMOHSFOther sellersOther servicers
2R679.0False200208NPF199Q100000720290215940.030.01.01.0O91.048.06.75031991.013300044700.0TrueNFRMOHSFOther sellersOther servicers
3T721.0False200209NNF199Q100001320290238060.00.02.01.0O39.013.06.62531839.017400085200.0TrueNFRMAZSFOther sellersOther servicers
4R618.0False200210NNF199Q100001520290210420.025.02.01.0O85.024.06.37531785.012200044200.0TrueNFRMOHSFOther sellersOther servicers
5R738.0False200211NPF199Q100001620290310420.00.02.01.0O73.044.06.00031773.021800044300.0TrueNFRMOHSFOther sellersOther servicers
6R761.0False200211NPF199Q1000017202904NaN0.02.01.0O73.031.06.37531873.013800029500.0TrueNFRMSCPUOther sellersOther servicers
7R707.0False200211NCF199Q100001820290333340.00.02.01.0O60.057.06.25031760.013600053000.0TrueNFRMWISFOther sellersOther servicers
8R760.0False200211NNF199Q100001920290333340.00.02.01.0O63.030.06.12531763.07900053000.0TrueNFRMWISFOther sellersOther servicers
9R691.0False200302NPF199Q100002320290115940.00.02.01.0O65.025.05.87531265.013000044700.0TrueNFRMOHSFOther sellersOther servicers

Last rows

CHANNELCREDIT_SCOREDELINQUENTFIRST_PAYMENT_DATEFIRST_TIME_HOMEBUYER_FLAGLOAN_PURPOSELOAN_SEQUENCE_NUMBERMATURITY_DATEMETROPOLITAN_STATISTICAL_AREAMORTGAGE_INSURANCE_PERCENTAGENUMBER_OF_BORROWERSNUMBER_OF_UNITSOCCUPANCY_STATUSORIGINAL_COMBINED_LOAN_TO_VALUEORIGINAL_DEBT_TO_INCOME_RATIOORIGINAL_INTEREST_RATEORIGINAL_LOAN_TERMORIGINAL_LOAN_TO_VALUEORIGINAL_UPBPOSTAL_CODEPREPAIDPREPAYMENT_PENALTY_MORTGAGE_FLAGPRODUCT_TYPEPROPERTY_STATEPROPERTY_TYPESELLER_NAMESERVICER_NAME
500127R754.0False200203NaNNF102Q112596920320215180.00.02.01.0O68.031.06.62536068.04000078500.0TrueNFRMTXSFWELLSFARGOHOMEMORTGAWELLSFARGOBANK,NA
500128R744.0False200203NaNNF102Q1125972203202NaN0.01.01.0O74.037.06.62536074.06600075100.0TrueNFRMTXSFWELLSFARGOHOMEMORTGAWELLSFARGOBANK,NA
500129R722.0False200203NaNNF102Q112597720320249020.00.01.01.0O89.021.06.62536079.07800022600.0TrueNFRMVASFWELLSFARGOHOMEMORTGAWELLSFARGOBANK,NA
500130R673.0True200203NaNNF102Q112598220320216740.00.02.01.0O55.035.06.62536055.08000028200.0FalseNFRMNCSFWELLSFARGOHOMEMORTGAWELLSFARGOBANK,NA
500131R774.0False200203NaNNF102Q112598520320219380.00.02.01.0O57.015.06.62536057.05900045400.0TrueNFRMOHSFWELLSFARGOHOMEMORTGAWELLSFARGOBANK,NA
500132R774.0False200203NaNCF102Q112598620320233460.00.01.01.0O61.038.06.62536061.07600055400.0TrueNFRMMNSFWELLSFARGOHOMEMORTGAWELLSFARGOBANK,NA
500133R689.0False200203NaNNF102Q112598920320210580.00.01.01.0O70.039.06.62536070.07000012300.0TrueNFRMNYSFWELLSFARGOHOMEMORTGAWELLSFARGOHOMEMORTGA
500134R798.0False200203NaNCF102Q112599020320219780.00.01.01.0O56.041.06.62536056.06500050300.0TrueNFRMIASFWELLSFARGOHOMEMORTGAWELLSFARGOBANK,NA
500135R791.0False200203NaNNF102Q112599120320242044.00.01.01.0O26.018.06.62536026.05100092600.0TrueNFRMCASFWELLSFARGOHOMEMORTGAWELLSFARGOBANK,NA
500136T773.0False200203NaNNF102Q1125993203202NaN0.01.01.0O33.048.06.62536033.08200033000.0TrueNFRMFLSFWELLSFARGOHOMEMORTGAWELLSFARGOHOMEMORTGA